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Testing for a Monotone Trend in a Modulated 
Renewal Process 

P. A. W. Lewis* and D. W. Robinson* 

Abstract , In examining point processes which are overdispersed with re- 
spect to a Poisson process, there is a problem of discriminating between 
trends and the appearance in data of sequences of very long intervals. In 
this case the standard "robust" methods for trend analysis based on log trans- 
forms and regression techniques perform very poorly, and the standard exact 
test for a monotone trend derived for modulated Poisson processes is not ro- 
bust with respect to its distribution theory when the underlying process is 
non-Poisson. However, experience with data and an examination of the depar- 
tures from the Poisson distribution theory suggest a modification to the 
standard test for trend, both for modulated renewal and general point process- 
es. The utility of the modified test statistic is verified by examining 
several sets of data, and simulation results are given for the distribution of 
the test statistic for several renewal processes. 

1. Introduction . Stochastic point processes or series of events can be 
described either through the sequence of times to events {T^}, or through the 
counting process {n }, where N is the number of events occurring in (0,tj. 
Trends on both serial number 1 and on time t are possible, but we only consid- 
er the time trends here, nor do we consider grouped data. 

A fairly complete description of trend analysis for Poisson point proc- 
esses is given in Cox and Lewis [^] , Lewis [ll], Lewis [10] and Brown [2]. 

In these works there is another minor difference which complicates matters; 
this is that observation may be for a fixed time interval (0,to] or for a 
fixed number n of events. Fixed time observation is more common in practice 
but the fixed number case is easier to simulate, so we consider both, depending 
on convenience. Except for messy details the results are essentially the same. 

We will also consider only the case of a simple monotone trend in time for 
the process, extending the Poisson theory to the case of more general point 

*Naval Postgraduate School, Monterey, California. This research was supported 
by the Office of Naval Research through grant NR0^2-28^. 
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processes. In the case of a non-homogeneous or modulated Poisson process a 
simple model pp . ^ 5 ] for the rate X(t), 

(1) X(t) = exp{a+3t} = Xexp{St}, t>0, A>0, 

leads to a uniformly most powerful conditional test for 3=0 against based 

on the statistic 



The conditioning is on N. , the observed number of events in (O^t^], since N 

to tg 

is a sufficient statistic for the nuisance parameter a for all 3. Conditional- 
ly the statistic has mean N /2 and variance N, ^/12, so the statistic 

^0 ^0 



(2) 



I(T^/to ) - ^ 
(n/12)''^ 



which converges rapidly to a unit normal variable under the null hypothesis, is 
used to test for 3=0. The hypothesis is rejected for large or small values 
of U. 

The test statistic U is computed in the SASE IV program for the analysis 
of point processes [I 3 ] and the program stops if |u|>l,96, since subsequent 
analysis in the program is for stationary processes. However, most users by- 
pass this stop because it almost always occurs. This has led to the present 
work, the supposition being that the distribution theory of U is very sensitive 
to the Poisson hypothesis. Two sets of data which lead to this program stop 
are discussed in the next section. Then other possible test statistics are 
discussed Section 3, and the distribution of a statistic similar to J]t^ is 
examined for the special case of a Gamma renewal process. This leads to a 
simple modification of the test statistic to account for the overdispersion of 
the intervals between events relative to the exponential distribution. 
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In subsequent sections simulation results for the null distribution of the 
statistic are given for other renewal processes. Then the modification of the 
test which is required for general point processes is discussed. It is the 
simplicity of the extension in this general case which makes the test statistic 
attractive when compared to other possibilities. The problem of the power of 
different tests for trend has not been considered. 

Finally we note that the situation we are interested in is that in which 
the point process is overdispersed with respect to the Poisson process. This 
will be defined to be the situation in which the index of dispersion for counts 
, pp. 71] , 

var Tn ] 

I = lim J(t) = r^- , 

E(N^j 

is greater than one, its value for the Poisson process. For the most part this 
corresponds to the marginal distribution of times between events having a coef- 
ficient of variation 



C(x) 



o(x) 

eTTT 



greater than 1. This is always true for renewal processes, and for cluster 
processes (see [12] and [8]). 

2 . Data Analysis . Two sets of data are examined here and the results of 
tests for trend based on U are discussed. 

Statistics for the first set are tabulated in Table 1. This set consists 
of 3 sequences of page exceptions in a multiprogrammed two-level memory comput- 
er with demand paging [14]. There is no particular compelling reason to expect 
a monotone trend in the data, except for an initial transient. This transient 
occurs because no page exception can occur until the memory is filled to the 
exception levels, which are ?6, 197, and 512 in the three sequences examined. 
The transient is almost negligible at level 76, where the test based on U 
(column 4) rejects homogeneity at a 1 % level. The rejection is stronger for 
the other levels, and at exception level 512 there is a very long transient and 
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therefore inhomogeneity . 

Note however that the intervals between events are very skewed with res- 
pect to the exponential distribution, the coefficients of variation given in 
column 5 being on the order of 3 , compared to 1 for an exponentially distrib- 
uted variate, and the coefficients of skewness given in column 6 of Table 1 
being greater than the value for the exponential distribution. 

An even more striking failure for the test occurs in the second set of 
data explored in Table 2 . The events are occurrences of earthquakes with 
energies greater than 4.0 on the Richter scale in California and Nevada from 
1932 to 1969* Six sections with equal numbers of events (except for the last) 
were analyzed and their statistics are given on the first six rows of Table 2 . 
Columns 5 to 7 show that the intervals are very skewed, and the estimated 
serial correlation coefficients in column 8 show the intervals to be 
correlated . 

There is no particular reason to expect a monotone trend in this data, 
but |u| is greater than I.96 for all sections. The average of the U values is 
-0.72 and the estimate of the standard deviation of U for the sections (the 
sample standard deviation of the 6 U*s) is shown in row 9 , column 4 to be 
5=7.82. This is far in excess of the value of o=l for the U statistic under 
the hypothesis of a homogeneous Poisson process. 

We will return to this data later on. 

3. General remarks on the test statistic . Neither of the series consid- 
ered above can be modelled as a renewal process since the estimated first 
serial correlation coefficients are large. In fact the first set has been 
modelled as a univariate semi-Markov process by Lewis and Shedler [l 4 ] and the 
earthquake data is well known to be some kind of cluster process (Lewis, [12]; 
Vere-Jones [I8]). 

It is useful to consider renewal situations however, even if they occur 
rarely in practice, because of analytical possibilities. Cox [3] has extended 
the model ( 1 ) to modulated renewal processes by defining the intensity function 
A(t) as 

(3) A(t) = z(u(t)) exp{a+ 6 t} , 
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Table 1 . Page exceptions in a multlprograjmned tv/o-level memory computer with 



demand paging 



Level 

pages) 


*^t 

^0 


to (page 
references ) 


u 


C(x) 


ys 


ys 

Pi 


u 

{C(x)l 


76 


1,807 


8 , 802,464 


-2.83 


3.34 


10.34 


+ 0.188 


-0.85 


197 


820 


8 , 802,464 


-8.67 


3.27 


7.14 


+0.177 


-2.60 


512 


517 


8 , 802,464 


-18.11 


3.70 


6.87 


+0.130 


-4.90 



Table 2 . 


Earthquake Data - 


All earthquakes 


with energies greater than 


4.0 in 




California and Nevada; 1932 - 


-1969 










Section 


'^t 


tjj (hours ) 


U 


C(x) 


Yi 


2 


ys 

Pi 


U 




to 












ia(x)i 


1 


468 


72,200 


4.4 


1.8 


5.50 


42.9 


+0.49 


2.44 


2 


468 


58,921 


-6.7 


1.65 


3.67 


22.4 


+0.16 


- 4.06 


3 


468 


*< 9,733 


9.9 


1.70 


2.80 


12.8 


+0.22 


5.82 


4 


468 


29,403 


2.1 


1.70 


3.30 


17.5 


+ 0.14 


1.23 


5 


468 


48,061 


-11.7 


1.50 


2.40 


9.8 


+0 . 34 


-7.80 


6 


^^31 


79 ,686 


-2.3 


1.25 


2.40 


12.6 


+ 0.12 


- 1.84 


Average 






-0.72 


1.6 


3.01 


19.67 


0.245 


-.702 


/s 

S- 

X 






( 3 . 19 ) 


( 0 . 81 ) 


( 0 . 68 ) 


( 4 . 99 ) 


( 0 . 059 ) 




✓V 

0 






7.82 


0.197 


1.67 


12.22 


( 0 . 144 ) 






TOTAL 


2771 


338,004 


-0.527 


1.63 


— 


— 


- 


-0.323 



Record 
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where z(-) is the hazard pp . 135] or hazard rate in the terminology 

of some workers in reliability theory. However, although a complete likelihood 
can be set up [3] it has not been possible to derive any explicit tests for 
6=0 from it. 

We therefore continue to examine modifications of the U statistic. For 
convenience, however, we consider the case of observation for a fixed number of 
events n. There are several reasons for this: 

(i) The fixed number case is much simpler to simulate and statistical dif- 
ferences between the two situations will be minor, especially for large 
samples . 



( 11 ) 


The 


sufficient 




n 






= 1 X 
i = l 


. , where X, 
1 ^ i 




P- 


52 ] is 






n 


(4) 




Y, = y S 
2n 



(5) 



n 



= I (n+l-l)X, 
i=l ^ 



Although this statistic can be considered conditionally on it follows 

from well known characterizing results for exponential and Gamma distributed 
variates (see Lukacs and Laha [ 15 ] ) that this is equivalent to considering the 
test statistic 



( 6 ) 






V . = 1-1 

I Xj 
1 = 1 



Moreover for any renewal model with intensity function (3) this statistic 
will be free of the nuisance parameter a for any 6, as can be easily shown. 
This is an important simplification. 

(iii) Analytical results for the fixed number case are simpler to obtain than 
those for the fixed time case. Moreover (6) suggests several other possibili- 
ties. From the form (5) for the numerator it can be seen that it is like an 
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empirical serial correlation between the natural numbers and the serially 
ordered times between events . This is the form of several standard tests 
for trend [ 7 , Ch. 45]. A possibility would be to replace the X^*s by exponen- 
tial scores and correlate the serially ordered scores with the index numbers i. 
Permutation tests of this sort have been discussed by Guillier [ 6 ] ; v;e do not 
pursue them here because they depend on the independence assumption in the re- 
newal hypothesis and we wish to consider more general point processes with 
dependent t imes-between-event s . 

Two other possible tests for trend are noted here. 

One is based on log transformations of the data and standard regression 
techniques, but as noted in Cox and Lewis [4, pp. 4i] these methods are likely 
to have poor relative power for intervals which are more dispersed than 
exponential variates. (For fairly regular processes they are likely to be the 
favored procedures.) 

The second possibility arises from an analogy between and goodness of 
fit tests. Define 



(7) 



'n,i 



n 

I Xj 



1 = 1 , .... (n-l). 



Then if denotes [17 ] the empirical cumulative distribution 

function for •••> (n-l), we have 

1 ^ 

( 8 ) / {F^(u) - u}du = (n+ 1 ) - . 



Thus Y is essentially a one sided Cramer-von Mises statistic and other norms 
n 

could be tried to measure the deviation of F (u) from the function u between 

n 

0 and 1 . 

Because the statistic Y and tests for trend based on it can be extended 

n 

to non-renewal processes, we consider its distribution first for Gamma renewal 
processes, then for several other renewal processes and then for cluster 
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processes . 



Testing In modulated Gamma renewal processes . The Gamma renewal 
process has independently distributed intervals with probability density func- 
tion [^, pp . 136] 



( 9 ) 



r^(x) 



fkl“ 

tuj r (k) 



x> 0 , k> 0 . 



where f (k) is the complete Gamma function. For k=l we have an exponentially 
distributed variate, and for k=k the square of a normal random variable. We 
will be concerned with the case k<l. We also have 



( 10 ) 



E(X) 



y; var (X) 




C(X) = . 

/IT 



Consider now the distribution of given by ( 6 ), which we write for 
convenience as 



( 11 ) 



Y 



n 



n 

I (n+l-i)X./n y, 

1 = 1 ^ 12n 

P y ’ 

^ X /n ^In 

i = l ^ 



The moments of the numerator and denominator are 

( 12 ) e(y;^) = y, var (y;^) = oVn, 

(13) ^(^ 2 n] “ (n+l)y/ 2 , var ^^ 2 n] " (n+ 1 ) ( 2 n+l)o^/( 6 n) . 

Now it is a characterizing property of Gamma distributed variates 
[15, p. 58] that the expected value of ratios of linear functions of the 
Gamma variates such as those appearing in ( 11 ) is the expected value of the 
ratio of the expectations. Thus we have, for Gamma renewal processes. 
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(14) 



(n+l)/2; 



E 




(15) 



var (Yj 



(n-1) (n+1) _ (n-1) (n+1) 

12 TkK?n- 12 („/c = (x)*l) 



(16) var [Y 1 - c’(x). 



Since C^(x) equals one for a Poisson process (k=l), this checks with re- 
sults for the statistic U given in (2). 

Note further that 

n 

1 

y' = Y ‘T*'l _ 1 = 1 
n n ■ 2 n 

I X./n 

1»1 ^ 



f n+1 1 

i 2n ■ n 



(17) 



f X 


- X , ,1 


f n+1 


- —1 


i 1 


n+l-lj 


i n 


nj 



n 

I X /n 
1=1 ^ 



( 18 ) 



I X>, 
1=1 ^ ^ 
n 

I Xa/n 
1=1 ^ 
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where j^j is the greatest integer less than or equal to n/2 , ^n+l-i 

is a symmetric random variable and a^ is an odd sequence. 

Using (18) we can show the following results: 

(1) The centered statistic Y* has odd moments which are all zero. This fol- 

n 

lows because the numerator in (18) is a sum of independent symmetric random 
variables and is therefore [5, Lemma 2, p. 1^9] Itself symmetric. 

This implies that the odd moments of the numerator (including the first) are 
zero and by the Lukacs and Laha result cited above, so are those of y\ 
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I'nus is a symmetric random variable. 

(ii) The numerator in (l8) divided by (n) ^ is asymptotically normal. Moreover 
since the denominator converges with probability one to y, which is non-zero, 
results from Billingsley [1, Corollary 2, p. 31] show that the reciprocal of 
the denominator converges with probability one to 1/y . Slutsky’s Theorem (see 
Billingsley [1] ) then says that 



(19) 



12 

[n-lj 

cTxy 



n 



N(0,1). 



(iii) Convergence to the normal distribution is likely to be very rapid be- 
cause of the symmetry of the distribution of Y^. 

To examine the small sample distribution of for the Gamma renewal case 
an extensive simulation was undertaken. Detailed results are given in Robinson 
[16J . The results are Illustrated in Table 3, which is extracted from 
Robinson [l6] . 

The simulations involved 100,000 replications using the random number 
generator LLRANDOM (Learmonth and Lewis [9] ) and a Gamma random number gener- 
ator developed by Robinson [l6] . The computations were checked by comparing 
the theoretical results for the mean and variance of the statistics with the 
simulated mean and variance. 

Only the case k=0.1 (c^(X)=10) is given in Table 3 because this was the 
most extreme case simulated and has the greatest departure from normality and 
the slowest convergence to the asymptotic normal form. Simulated quantiles of 
Y^, normalized by subtracting the mean (l4) and dividing by the square root of 

the variance (15) (these are listed in the last two rows of the table) are 

shown in Table 3. Because of the symmetry of the distribution, only the lower 

quantiles corresponding to levels a=0.001, 0.002, 0.005, 0.010, 0.020, 0.025, 

0.050, 0.100, 0.200, 0.300, 0.^00, 0.500 are given. They are actually the 
average of the simulated upper and lower quantiles and have a standard devia- 
tion of approximately 0.001. 

The distribution can be seen to be a little more peaked than a normal dis- 
tribution, with shorter tails, but even by n=50 a normal approximation to the 
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Table 3 . Simulation results for the statistic for Gamma distributed inter- 
val with k= 0.10 under the null hypothesis of no trend ( 6 = 0 ). 

Quantiles of are normalized by subtracting E(Yt) and dividing by 
n j 

0 (Yj). 



a 


n =10 


n=30 


n =50 


n =100 


Norml quantile 


0.001 


- 2.202 


-2.740 


-2.915 


-3.001 


-3.090 


0.002 


-2.191 


- 2.607 


-2.750 


-2.812 


-2.878 


0.005 


- 2.148 


- 2.460 


-2.500 


-2.545 


-2.576 


0.010 


-2.078 


-2.231 


-2.290 


-2.313 


-2.326 


0.020 


-1.944 


- 2.014 


- 2.049 


-2.054 


-2.054 


0.025 


-1.875 


-1.935 


-1.960 


-1.965 


- 1.960 


0.050 


-1.654 


-1.665 


-1.665 


-1.656 


- 1.645 


0.100 


-1.343 


-1.320 


-1.307 


-1.297 


-1.282 


0.200 


-0.924 


-0.881 


-0.871 


-0.856 


- 0.842 


0.300 


-0.591 


-0.554 


-0.549 


-0.537 


- 0.524 


0.400 


-0.279 


-0.272 


-0.265 


-0.261 


- 0.253 


0.500 


- 0.001 


-0.005 


0.002 


-0.003 


0.000 


E(Y„) 


5.5 


15.5 


25.5 


50.5 




c(Yn> 


2.031 


4.328 


5.891 


8.703 
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distribution of is adequate for purposes of hypothesis 
The proposal for testing a monotone trend in a Gamma 
derived from these results is to estimate the coefficient 
the data and test for B=0 using 



{C(X)} 

and assuming that its distribution is that of a unit normal distribution. This 
essentially uses the Poisson test statistic divided by C(X). This modified 
statistic is given in the last columns of Tables 1 and 2, The test results 
are more in line with expectations, but still do not reflect inflation of the 
variance of U because of correlation between Intervals between events. This is 
discussed in Section 6. 

5. Distributional results for other renewal cases . The result (14) holds 
for any stationary sequence X^, ..., X^, including a renewal (l.i.d.) sequence. 
This is because 



testing, 
renewal process 
of variation from 





+ . . . 


+ 


X 

n 


= nE 


^1 1 


X, 


+ . . . 


+ 


X 


X, + ... + X 








n^ 




1 nj 



orE{x^/^X^ + ... + n Taking expectations in (6) and 

using the form (5) for ^2n yields 




This result merely says that Y^, which is a normalized centroid of times to 
events in an interval stationary point process, always has the expected value 
(n+l)/2. 

Thus the centering in (17) is correct for all sequences and we discuss 

y’ from here on. 
n 

Another useful result is that Y^ is a symmetric random variable for any 
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renewal sequence. To see' this note that -Y^ can be written exactly In the 

form (l8) with ~ > but since these are symmetric random variables 

and the X 's are Independent, the functional form for -y' Is exactly the same 
± n 

as that for y\ Thus they have the same distribution and thus Y^ Is symmetri- 
cal random variable. All odd moments are thus zero. In addition by arguments 
of the previous section, Y^ is asymptotically normal with variance (l6) If 
var (X)<«> for any renewal process. 

\ 

To explore the small sample distribution of Y further for renewal cro- 

n 

cesses using simulation we chose two other density functions for the Intervals. 
The first Is the Welbull density function 

(20) fj^(x) = ke^x^^'^expC-B^x^) 6>0, k>0, x>0 

which reduces to the exponential for k=l. In the simulation the parameters 
were chosen so that the means and coefficients of variations of the intervals 
X were the same as for the Gamma cases. 

The second density function chosen was the log-normal density, again with 
parameters chosen to match the means and coefficients of variations in the 
Gamma cases. Note that both these densities are, for given coefficient of 
variation, more skewed than the Gamma density, the log-normal more so than the 
Weibull. In addition both have hazard functions which approach zero as 
in contrast to the Gamma density which has an exponential tall. 

I 

It is possible to compute var(Y^) for finite n in both these cases, but 
the results are messy. In general the variances are smaller than for the 
Gamma case; simulation results give, when C^(X) =10.0 and n = 50, values of 
5 . 891 , 5.182 and 4.355 for the Gamma, Welbull and log-normal cases 
respectively . 

Only the worst case of the simulations for the Welbull and log-normal in- 
tervals, l.e., those matching the Gamma case with C^(X) = 10.0 are given, in 
Table 4 and 5 respectively. Again 100,000 replications v/ere used. 

The normalized quantiles show distributions for y' at n = 10, 30, 50, 100 

n 

for both densities and, in addition, for n = 200 for the log-normal case. In 
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Table 



Simulation results for the statistic for Weibull distributed in 

tervals with C^(x) = 10.0 under the null hypothesis of no trend 

(B=0). Quantiles of Y are normalized by subtracting E(Y ) and 

n n 

dividing by . 

Normal 

n = 10 n = 30 n = 50 n = 100 Quantile 



001 


-2.533 


-2.922 


002 


-2,473 


-2.772 


005 


-2.343 


-2.521 


010 


-2.188 


-2.301 


020 


-1.987 


-2.042 


025 


-1.920 


-1.954 


050 


-1.659 


-1.652 


100 


-1.324 


-1.294 


200 


-0,883 


-0.850 


300 


-0.557 


-0.531 


400 


-0.271 


-0.255 


500 


-0,002 


-0.000 



Y 

n 


5.500 


15.490 


(’'n) 


1.678 


3.703 


■W 


-0,002 


-0.001 




2.52 


2.86 



-3. 067 


-3.214 


-3.090 


-2.845 


-2.973 


-2.878 


-2,570 


-2.635 


-2.576 


-2.326 


-2.373 


-2.326 


-2.052 


-2.069 


-2,054 


-1.960 


-1.971 


-1,960 


-1.644 


-1.641 


-1.645 


-1.280 


-1.272 


-1.282 


-0.845 


-0.831 


-0 , 842 


-0.528 


-0.516 


-0.524 


-0.259 


-0.249 


-0.253 


0.000 


-0.002 


0,000 


25.527 


50.495 




5.182 


7.953 




-0.001 


-0.001 




2.96 


3.14 








W t 



Table 5 . 



Simulation results for the statistic Y for loe-normal distributed 

n — “ 

Intervals with C^(x) = 10.0 under the null hypothesis of no trend 



.001 
.002 
.005 
.010 
.020 
.025 
.050 
.100 
.200 
. 300 
. 400 
. 500 

(^n) 



( 3 = 0 ). Quantiles of Y are normalized by subtracting E(Y ) and 



dividing by 



n = 10 


n = 30 


n = 50 


- 2.941 


-3.342 


-3.452 


- 2.805 


- 3.084 


-3.167 


- 2.550 


-2.725 


- 2.775 


- 2.325 


-2.434 


-2.445 


- 2.073 


- 2.104 


-2.098 


- 1.975 


-1.997 


-1.991 


-1.656 


-1.638 


-1.629 


-1.289 


-1.252 


- 1.246 


- 0.843 


-0.813 


- 0.804 


- 0.524 


-0.502 


-0.495 


-0.255 


- 0.241 


-0.236 


0.001 


0.000 


0.003 


5.501 


15.484 


25.518 


1.365 


3.059 


4.355 


- 0.004 


0.004 


-0.015 


2.89 


3.35 


3.51 



Normal 

n = 100 n = 200 Quantile 



-3.692 


-3.831 


-3.090 


-3.361 


- 3.411 


-2.878 


- 2.845 


- 2.868 


-2.576 


-2.471 


-2.472 


-2.326 


-2.106 


-2.094 


-2.054 


-1.988 


-1.978 


-1.960 


-1.621 


-1.606 


- 1.645 


-1.230 


- 1.224 


- 1.282 


-0.791 


-0.786 


- 0.842 


- 0.487 


CO 

0 

1 


-0.524 


-0.236 


-0.231 


-0.253 


-0.003 


0.003 


0.000 


50.492 


100 . 463 




6.889 


10.699 




- 0.002 


- 0.001 




3.87 


1 — 1 
rH 

-=:r 
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both cases the distributions have heavier tails than in the Gamma case, and 
estimated kurtoses greater than one. The convergence to the asymptotic 

normal distribution is particularly slow for the log-normal case, but in no 
case is the normal approximation too far off at the quantiles corresponding to 
the usual significance levels used in hypothesis testing. Actually division 
of the quantiles by C (X) ( (n-1 )/12) from (l6) rather than by the true standard 
deviation of provides a better normal approximation than does division of 
the quantiles by the true Var (Y^). 

Convergence is of course faster and the normal approximation better for 
the cases not shown here, i.e. for intervals with coefficients of variation 
approaching the value one of the exponential distribution. Note that C^(X) = 
10.0 approximates the values found for the computer data of Table 1. 

6. Distributional results for general point processes . The finding from 
the previous sections was that for renewal sequences the null hypothesis vari- 
ance of is inflated by approximately C^(X) over its value for a Poisson 
process. The approximation is exact for large n. 

However, in both examples cited in Section 2 the intervals between events 
X^ are correlated (see the values in Tables 1 and 2). It turns out that for 

a simple statistic such as Y^ fairly broad results can be obtained for general 
point processes, the modification to the variance of Y^ again being simple to 
compute from the data. Thus a rough test of trend can be performed. 

Details of the derivation will be given elsewhere. For a broad class of 
situations Y^ is asymptotically normally distributed with variance 

(21) var ~ {TrCMX)f^ (0+ ) } , 

where f^(0+) is the initial point on the spectrum of the intervals {X^} of the 
process. Since f^(0+) is related to the Initial point of the spectrum of 
counts, s^(0+), and the asymptotic slope, V '(«>), of the variance time curve, 
var {N },of the point process by the relationship [4, p, 78 ] 

(22) V'(») = TTg^(0+) = f^(0+) , 
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we can write (21) as 



(23) var [Y^j ' (e(X)V' («)} . 

The quantity V'(®) is simple to estimate from the data [^4, pp. 115-120] , 

thereby providing an easy modification for the test statistic Y'. 

n 

For a renewal process, f^(0+) = lA , and (21) reduces to (l6). Poisson 
cluster processes [12, l8] have been used to model the earthquake data of 
Section 2. If the length of the cluster In the cluster process is denoted by 
S, we have 

(24) var [Y^j ~ E(S+1 ) { 1 + C2(S+1)}, 



where C^(S+1) is the coefficient of variation squared of S+1. When there is no 
cluster, i.e. S=0 with probability 1, the result (2^) reduces to that for the 
Poisson process. 

For the earthquake data, which has long and very variable clusters, the 
multiplier of (n-l)/12 in (24) has an estimated value of approximately 49. C. 
Dividing the U values given in column 4 of Table 2 by (49)^=7.0, we obtain a 
test statistic which accepts the hypothesis of no trend in all 6 sections of 
the data. 



7. Conclusions and further work . The recommendation put forward in this 
paper is to test for trend in a point process using the U statistic (2) divided 
by the estimated coefficient of variation C(X) in a renewal process, or an 
estimate of {E(X)v’(«)J^ in (23) for a general point process. 

The test is not proposed as being in any sense optimal, but because it can 
be used without detailed knowledge of the structure of the process it is very 
functional. It would be nearly optimal if the point process were close to a 
Poisson process. 

The power of the test needs to be Investigated so that its utility can be 
assessed relative to other tests, especially for processes which are highly 
overdispersed relative to the Poisson process. Point processes of that type 
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occur in many applications. 



Other tests to be considered could be standard regression tests after a 
log transform or scoring of the Intervals In the data; rank correlation tests 
using, perhaps, exponential scores for the Intervals, and other functionals 
than that given in (8) for measuring the "distance” of from u (see [^, 

Ch. 6]). There are other possibilities explored in a recent thesis by Guillier 

[ 6 ] . 
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